Information item morphing system

ABSTRACT

The invention relates to a method of generating sequencing information representing a sequence of items selected in a database. Each of the items comprises a set of descriptors. The method is characterised by comprising the steps of: a) specifying a length of items and at least one of the descriptors; b) performing similarity relation techniques between the items; and c) generating a fixed-length sequence having a morphological continuity. Each of the items is represented by a series of constraint variables having a domain in the database, and modelling each of the values of descriptors in a desired sequence as a constrained variable. The similarity relation performing step may comprise performing global similarity techniques by combining individual similarity measures on all of the descriptors. The sequence-generating step comprises transforming at least one of the values into unary constraints in terms of constraint satisfaction programming techniques.

DESCRIPTION

[0001] The present invention relates to a system that produces fixed-length sequences of items out of a database, such as music title programmes from a music catalogue. A sequence thus produced has to comply with partial information specified by a user, and to be “continuous” in terms of “morphologies” of the items generated in the sequence. This partial information may be, for example, a first title and a last title in the sequence of items (simple case), or part of “morphological” information on any particular item in the sequence. In the musical field, this information may be a musical descriptor such as genre, a type of rhythm, the tempo, etc. The “continuity” between the items is achieved through a similarity relationship. This relationship is defined as a combination of individual similarity measures described for each possible descriptor value.

[0002] Advances in networking and transmission of digital multimedia data has provided users with a huge number of information catalogues, such as music catalogues. These advances thus raise not only the problem of distribution, but also the problem of choosing desired information among huge catalogues.

[0003] Such new developments raise music selection problems which may depend on the aims of users or content providers. Although modelling a user's goal in accessing music is very complex, two basic elements, i.e. desire of repetition and desire of variation or surprise, can be identified.

[0004] The desire of repetition means that people want to listen to music they already know, or similar to what they already know. Sequences of repeating notes create expectations of the same notes to occur. On the other hand, the desire for variation or surprise is a key to understanding music at all levels of perception.

[0005] Of course, these two desires are contradictory, and the issue in music selection is precisely to find the right compromise: provide users with items they already know, or items they do not know but would probably like.

[0006] From the viewpoint of record companies, the goal of music delivery is to achieve a better exploitation of the catalogue. Indeed, record companies have problems with the exploitation of their catalogues using standard distribution schemes. For technical reasons, only a small part of a catalogue is actually “active”, i.e. proposed to users, in the form of easily available products. More importantly, the analysis of music sales shows clearly decreases in the sales of albums, and short-term policies based on selling many copies of a limited number of items (hits) are no longer efficient. Additionally, the sales of general-purpose “samplers” (e.g. “Best of love songs”) are no longer profitable, because users already have the hits, and do not want to buy CDs in which they like only a fraction of the titles. Instead of proposing a small number of hits to a large audience, a natural solution is to increase diversity, by proposing more customised albums to users.

[0007] In the present invention, the term “database” is used for designating any collection of data, e.g. covering both pre-stored data and dynamically stored data. The term “metabase” is used to describe a database containing descriptors of the items in the database. There are many situations in which it is necessary or desirable to create a sequence of items (e.g. music titles) from a collection of items for which data are available. It is also important that a created sequence is “coherent”, i.e. there should exist a particular relationship between descriptors of the items which constitute a sequence. Typically, the descriptors of the items, components of the sequence, should not be too dissimilar, especially for successive items in the same sequence. A typical case where the problem supra arises is in the field of multimedia. A notable problem concerns an automatic generation of music programs, the latter being an example of temporal sequence. Here the term “program” is used not only to designate a sequence of musical pieces, but also, more generally, any temporal sequence of multimedia items, e.g. film clips, documentaries, texts.

[0008] A system producing “coherent” sequences of items in a particular order is disclosed in European patent application EP-A-0 961 209.

[0009] The items descriptors are stored in a metabase and consist of data pairs respectively consisting of a descriptor and a corresponding value. The problem of creating the desired sequence is treated as a “Constraint Satisfaction Programming (CSP)”, also disclosed in the European patent application supra. The sequence to be obtained is specified by formulating a collection of constraints holding on items in the metabase. Each constraint describes a particular property of the sequence, and the sequence can be specified by any number of constraints.

[0010] The items in the metabase exhibit a particular generic format with associated taxonomies for at least some of the descriptors. Also, the constraints are specified out of a predetermined library of generic constraint classes which have been specially formulated. The special constraint classes allow the expression of desired properties of the target sequence, notably properties of similarity between groups of items, properties of dissimilarity and properties of cardinality. These constraint classes enable the properties of coherent sequences to be expressed in a particularly simple manner.

[0011] It is the combination of the use of a generic format for items in the data base and the special constraint classes which enables the use of a CSP solution technique to solve the combinatorial problem of building an ordered collection of elements satisfying a number of constraints.

[0012] It is an object of the present invention to provide a system which enables users to produce a fixed-length sequence of items out of a database by specifying only partial information. The main innovation of the invention is 1) the introduction of a special class of constraint, namely the global continuity constraint which allows to compute a “morphing” between two titles and 2) the possibility of specifying partial information about arbitrary titles in the sequence to be produced.

[0013] To this end, there is provided a method of generating sequencing information representing a sequence of items selected in a database, each of the items comprising a set of descriptors. The method comprises the steps of:

[0014] a) specifying a length of the sequence and at least one of the descriptors;

[0015] b) applying similarity relation techniques between the items; and

[0016] c) generating a fixed-length sequence having a morphological continuity.

[0017] In the above method, each of the items may be represented by a series of constraint variables having a domain in the database.

[0018] Further, the above-mentioned similarity-relation applying step may comprise modelling each of the descriptors in a desired sequence as a constrained variable.

[0019] Further yet, the similarity-relation applying step may comprise applying a global similarity relation technique by combining individual similarity measures on all of the descriptors.

[0020] Typically, the similarity-relation applying step comprises providing mathematical similarity functions.

[0021] Suitably, the similarity-relation applying step comprises providing similarity relations defined by given thresholds.

[0022] In the above method, the sequence-generating step may comprise transforming the at least one of the values into unary constraints in terms of constraint satisfaction programming techniques.

[0023] Suitably, the above sequence-generating step further comprises subjecting the unary constraints to a processing of variables domain reduction.

[0024] In the above method, the descriptors are preferably expressed in terms of descriptor/value pairs respectively, and each of the values for the descriptor is selected from descriptor/value lists.

[0025] Further, each of the descriptors may be associated to a descriptor type.

[0026] Preferably, the above descriptor type comprises at least one type selected from the group consisting of Integer-Type, Taxonomy-Type and Discrete-Type.

[0027] In the above-described method, the step of specifying at least one of said values may comprise specifying a first title and a last title of the items in the sequence.

[0028] Further, the step of specifying at least one of the values may comprise specifying a morphological style of the items in the sequence.

[0029] In a typical case, the above database comprises musical pieces, and the values comprise titles, and the titles form a music program.

[0030] The present invention further provides a system adapted to implement one of the above-described methods, which system comprises a general-purpose computer and a monitor for display of the generated information.

[0031] The invention also relates to a computer program product adapted to carry out one of the above-mentioned methods, when it is loaded into a general purpose computer.

[0032] The above and other objects, features and advantages of the present invention will be made apparent from the following description of the preferred embodiments, given as non-limiting examples, with reference to the accompanying drawings, in which:

[0033]FIG. 1 illustrates a taxonomy of musical styles, in which links indicate a similarity relation between styles. Here, “Jazz-Crooner” is represented as similar to “Soul-Blues”;

[0034]FIG. 2 illustrates an example of music programs defined by descriptors;

[0035]FIG. 3 shows the general data flow according to the concept of the present invention; and

[0036]FIG. 4 illustrates an example of a user interface for specifying partial information on the descriptors for a sequence of length 8.

[0037] In the preferred embodiments, the invention is applied to the automatic composition of musical programmes, for example for radio, set-top boxes, etc.

[0038] The description of the preferred embodiments of the invention will begin with an explanation of the constitutive elements, on the basis of which the present invention is implemented.

[0039] The present invention therefore uses constraint satisfaction programming techniques already described in European patent application EP-A-0 961 209, whose corresponding part is herewith expressly incorporated by reference in its entirety.

[0040] In the technical field of the present invention, and more particularly in the musical field, applications targeted at non professionals have also been developed using “RecitalComposer”, an embodiment of the previous patent application. “PathBuilder” is an application in which the user can specify a starting title and an ending title. The system contains hidden constraints on continuity of styles, and tempos are fixed. For instance: find a continuous path between Céline Dion's “All by myself”, and Michael Jackson's “Beat it”. Another similar application allows users to specify only the stylistic structure of the program. This may be used for instance for creating long programs for parties, in which the structure (e.g. begin with Pop, then Rock, then Slows, etc.) is known in advance.

[0041] Such an approach can be used to produce music programs in specific styles, by adding domain specific constraints. Other applications are envisaged for set-top-box services and digital audio broadcasting.

[0042] As can be understood from the foregoing, “RecitalComposer” is an enabling technology for building high-level music delivery services. The system is based on the idea of creating explicit sequences of items, specified by their global properties, rather than on computing sets of items satisfying queries. One of its main advantages over other approaches is that it produces ready-for-use music programs which satisfy the goals of music selection: repetition, surprise, and exploitation of catalogues.

EXAMPLES OF THE INVENTION

[0043] The present invention is described hereinafter with reference to the sequences containing music titles. The invention relates to a method of specifying or describing some or the entirety of the descriptors of the titles in a sequence, thereby automatically generating distance measuring, and performing so called “morphing” between two or more items, e.g. music titles.

[0044] The present invention has the following features:

[0045] 1) it allows the user to specify partial descriptions of titles (and not only a “specific description”, i.e. entirely specified title); and

[0046] 2) it allows the user to specify descriptions for “arbitrary” titles in the sequence (and not only the first or ending title).

[0047] 1. Metabase

[0048] The invention assumes the existence of a database of metadata, referred to as a metabase. The metabase of items, e.g. music titles, contains content information needed for specifying the constraints.

[0049] Each item is described in terms of descriptors which take their value in a predefined taxonomy. The descriptors are of two sorts: technical descriptors (descriptors) and content descriptors (values). Technical descriptors include the name of the title (e.g. name of a song), the name of the author (e.g. singer's name), the duration (e.g. “279 sec”), and the recording label (e.g. “Epic”). Content descriptors describe musical properties of individual titles. The descriptors may be the following: “style” (e.g. “Jazz Crooner”), “type of voice” (e.g. “muffled”), “music setup” (e.g. “instrumental”), “type of instruments” (e.g. “brass”), “tempo” (e.g. “slow-fast”), and other optional descriptors such as the “type of melody” (e.g. “consonant”), or the main “theme” of the lyrics (e.g. “love”).

[0050] No assumptions are made as regards how the metabase is created. Some of the descriptors may be entered by hand, others extracted automatically, such as the tempo (see e.g. Scheirer, E. D., J. of the Acoustical Society of America, 103 (1), 588-601, 1998), or the rhythm structure (see previous patent application EP 00 401 915.4).

[0051] Although the invention is largely independent of the actual structure and content of the metadatabase, an example of such a metadatabase is given hereinafter. Typically, the descriptors include the following:

[0052] Title name

[0053] Author

[0054] Style

[0055] Tempo

[0056] Energy

[0057] VoiceType

[0058] MainInstrument

[0059] RhythmType

[0060] The possible values for each of these descriptors are taken from descriptor-value lists. Each descriptor is associated to a “Descriptor-Type”. For instance, the Tempo descriptor is of Integer-Type (its value is an integer). The Style descriptor is of type “Taxonomy-Type”. The MainInstrument descriptor is of type “DiscreteDescriptor”, i.e. can take its value in a finite set of discrete values.

[0061] 2. Taxonomies of Values and Similarity Relations

[0062] An important aspect of the metabase is that the values of content descriptors are linked to each other by similarity relations. These similarity relations are used for specifying constraints on the continuity of the sequence (e.g., the preceding example contains a constraint on the continuity of styles). More generally, the taxonomies on descriptor values establish links of partial similarity between items, according to a specific dimension of musical content.

[0063] For all descriptor types, there is supposed the existence of a similarity relation similarity_X. This relation indicates whether a value for a given descriptor is “similar” to another value. For instance, the Style descriptor takes its value in a taxonomy of styles, in which the similarity relation is explicitly present (e.g. style_value=“Disco:US” could be explicitly stated as similar to style_value=“Disco:Philadelphia”). Other descriptors can have mathematical similarity functions. For instance, the tempo descriptor ranges over integers, on which similarity relations are defined using thresholds: similar-tempo(a, b) if |b−a|<threshold.

[0064] 3. Input/Output

[0065] The present embodiment of the invention uses, as input,:

[0066] (1) a sequence length n>1, and

[0067] (2) a limited number of descriptors (partial descriptors) for each item of the sequence.

[0068] It gives, as output, a sequence of length n, which satisfies a set of conditions defined below.

[0069] The “partial descriptors” are of the following form. For each title t_(i) of the sequence (1<=i<=n, where n is the length of the sequence), any number of descriptors is given a possible value, including “non specified”.

[0070]FIG. 4 shows an example of a user interface for specifying this partial information, for a sequence of length 8.

[0071] Once specified, the sequence is computed so that two conditions are satisfied:

[0072] (i) titles of the sequence satisfy the corresponding partial specifications (when they exist, i.e. when they are different from “non specified”);

[0073] (ii) titles are linked to each other by a global similarity relation SIM, defined below.

[0074] 4. Algorithm

[0075] (A). Similarity Relation

[0076] The algorithm uses a global similarity function SIM defined between two music titles. This function is a Boolean function (yields a yes/no answer). The function SIM can be defined in various ways. But in the present invention, all cases are defined from the individual similarity relations on each descriptor's “similarity-X” (see above).

[0077] The algorithm of the invention can in principle cope with any function SIM defined from similarity-X relations. In most cases though, the SIM function is defined as a logical combination of individual similarity-X relations. For instance, the SIM function can be defined as follows:

[0078] (B). Definition of the SIM Function

[0079] The number of descriptors of t₁ which are similar to the corresponding descriptors of t₂ is less than 1.

[0080] In pseudo-code:

[0081] SIM (t₁, t₂)=

[0082] CPT=0;

[0083] FOR I=1 to Max-Descriptor

[0084] if Similarity-_(i)(t₁, t₂)=false, then CPT:=CPT+1;

[0085] end FOR

[0086] return CPT<=1

[0087] (C). Description of the Algorithm

[0088] (1). General aspect

[0089] The invention makes use of a constraint solver system, as in previous patent application EP 0 961 209.

[0090] The sequence generation problem is represented as a constraint satisfaction problem:

[0091] the constrained variables are each title of the sequence t_(i).

[0092] The domain of these variables is a database of titles.

[0093] the constraints are the following:

[0094] i) Each “partial” specification given by the user is transformed into unary constraints, which are themselves transformed into variable domain reduction in a straightforward fashion. For instance, if the user wants the style of the third title to be “Jazz”, then only items satisfying this constraint will be retained in the domain of t₃.

[0095] ii) The global similarity relation SIM is represented as a binary constraint established systematically between all pairs of contiguous title variables (i.e. between t_(i) and t_(i+1), for i=1 to n−1).

[0096] A constraint satisfaction algorithm is applied using an arc-consistency technique for the similarity constraints, defined as follows:

[0097] The main specific aspect of this algorithm is the implementation of the filtering procedure for the binary SIM constraint. This constraint is implemented as follows.

[0098] (2) Description of the similarity constraint

[0099] (i) Notations

[0100] We present here the set of notations used in this section of the document.

[0101] The symbol ˜ represents the similarity between descriptor values in their respective taxonomies

[0102] We use uppercase for variables, e.g. X, Y

[0103] We use lowercase for values, e.g. x, y

[0104] Dom(X) denotes the domain of variable X

[0105] C(X, Y) denotes a constraint involving two constrained variables X and Y

[0106] Constraint C(X, Y) is defined by a formula of satisfaction (intentional definition), e.g. if C(X, Y) is an equality constraint, it will be defined by:

C(X, Y) (x, y)=1 iff x=y

[0107] We systematically identify 1 with True and 0 with False

[0108] (ii) Preliminary

[0109] In our approach, we implement similarity constraints by means of simpler constraints: counting and descriptor constraints. Counting constraints are used to evaluate the number of differences between the descriptor values of two titles. Descriptor constraints are used, in conjunction with descriptor variables, to represent descriptor values of titles as constrained variables. This is necessary to enable the definition of constraints over the descriptors themselves.

[0110] (a) Counting Constraints: CNTR (X, Y, B)

[0111] Given two constrained variables X and Y, given B a 0/1-constrained variable, and given a relation R defined over Dom(X) x Dom(X)—the Cartesian product of the domains of X and Y, we define the constraint CNT_(R) (X, Y, B) (CNT stands for Counting constraint) by:

[0112] CNT_(R) (X, Y, B) (x, y, b)−1 if

[0113] (x R y) and b=1

[0114] or not(x R y) and b=0

[0115] CNT_(R) (X, Y, B)(x, y, b)=0 otherwise

[0116] The filtering method for the CNT constraints consists of the following rules (every applicable rule is applied):

[0117] [Variable, Demon=> Actions]

[0118] X, value(x)=> {

[0119] If value(B)=1, remove from Dom(Y) every y such that not(x R y)

[0120] If value(B)=0, remove from Dom(Y) every y such that x R y

[0121] If for every y in Dom(Y), x R y holds, set the value of B to 1

[0122] If for every y in Dom(Y), x R y does not hold, set the value of B to 0}

[0123] Y, value(y)=> {

[0124] If value(B)=1, remove from Dom(X) every x such that not(x R y)

[0125] If value(B)=0, remove from Dom(X) every x such that x R y

[0126] If for every x in Dom(X), x R y holds, set the value of B to 1

[0127] If for every x in Dom(X), x R y does not hold, set the value of B to 0}

[0128] B, value(b)=> {

[0129] If b=1

[0130] if value(X)=x, remove from Dom(Y) every y such that not(x R y)

[0131] if value(Y)=y, remove from Dom(X) every x such that not(x R y)

[0132] else (value(B)=0)

[0133] if value(X)=x, remove from Dom(Y) every y such that x R y

[0134] if value(Y)=y, remove from Dom(X) every x such that x R y}

[0135] Other demons do not trigger any action.

[0136] (b) Descriptor Constraints and Descriptor Variables

[0137] Given a constrained variable A and given a function f defined over Dom(A), the descriptor variable f(A) is defined by its domain:

[0138] Dom (f (A))=f (Dom (A))={f(a)|a in Dom (A)}

[0139] Given two constrained variables A and B, and given a function f defined over Dom (A) and taking values in Dom (B), we define the descriptor constraint ATTR_(f)(A, B) by:

[0140] ATTR f (A, B) (a, b)=1 if b=f (a)

[0141] ATTR_(f) (A, B) (a, b)=0 otherwise

[0142] The filtering of descriptor constraints is the following:

[0143] [Variable, Demon=> Actions]

[0144] A, value(a)=> set the value of B to f (a)

[0145] B, value(b)=> remove every a from Dom (a) such that f (a) !=b

[0146] B, remove(b)=>remove every a from Dom (A) such that f (a)=b

[0147] Other demons (e.g., A, remove (a)) do not trigger any action.

[0148] (c) Similarity Constraints

[0149] Given two title variables A and B, and given a natural number N, the similarity constraint S_(N) (A, B) between A and B is defined by:

[0150] S N (A, B) (a, b)=1 if |{i=1 . . . P|not(a.i˜b.i)}|<=N

[0151] S_(N) (A, B) (a, b)=0 otherwise

[0152] Which is equivalent to:

[0153] S N (A, B) (a, b)=1 if |{i=1 . . . P|a.i˜b.i}|>=P−N

[0154] S_(N) (A, B) (a, b)=0 otherwise

[0155] In these formulas, P represents the number of descriptors defined for a title, and “a.i” denotes the i^(th) descriptor of title a.

[0156] To state a similarity constraint, we use descriptor variables (and descriptor constraints) to represent descriptor as constrained variables. We then define counting constraints to represent the number of similarities between descriptors of two titles as constrained variables. Then, we use a linear arithmetic constraint to limit the number of dissimilarities between two successive titles.

[0157] Technically, we define an additional descriptor variable for every descriptor of each title variable. Those descriptor variables are linked to the title variable they come from by an descriptor constraint. We then define a 0/1-variable for each descriptor variable. The 0/1-variable and the corresponding descriptor variable are linked together by a counting constraint. Optionally, we state a linear arithmetic constraint over the 0/1-variables which constrains the number of similarities between descriptors of the two title variables.

[0158] More precisely:

[0159] Let A and B be two title variables with P descriptors

[0160] Let N be a natural number (between 0 and P)

[0161] We state the similarity constraint S_(N) (A, B) as follows:

[0162] For i=1 . . . P, we define Ai (resp. Bi) the descriptor variable of A (resp. B) corresponding to descriptor i; i.e. if style is the third descriptor, A3 is the variable whose domain is the set of styles of titles in the domain of A.

[0163] For i=1, . . . ,P, we state a constraint ATTR(A, Ai) (resp. ATTR(B, Bi)) defined by ATTR(A, Ai) (a, b)=1 iff a.i=b (resp. ATTR(B, Bi) (a, b)=1 iff a.i=b)

[0164] For i=1, . . . ,P, we define a 0/1-variable Ci

[0165] For i=1, . . . ,P, we state CNT_(˜)(Ai, Bi, Ci), the counting constraint for relation˜

[0166] We state a linear arithmetic constraint C1+ . . . +CP>=P−N

[0167] A similarity constraint on two title variables is therefore defined by:

[0168] 2.P descriptor variables (one for each title variable, and for each descriptor);

[0169] 2.P descriptor constraints linking every descriptor variable with the corresponding title variable;

[0170] P 0/1-variables, one for each pair of descriptor variables corresponding to the same descriptor;

[0171] P counting constraints, linking each pair of descriptor variables with one 0/1-variable;

[0172] One linear arithmetic constraint over the 0/1-variables.

[0173] The filtering of similarity constraints is achieved by the filtering of the different descriptor and counting constraints defined . . . (properly speaking, the similarity constraint doesn't exist, it is a collection of additional variables linked together by descriptor and counting constraints)

[0174] (D). Applications of the invention:

[0175] i) Sequences are generated by virtue of using the interface and algorithm described above.

[0176] ii) Iterative fixed-length sequences are generated, in which the user can iteratively apply the scheme to build sequences by refinement, e.g. by selecting, in a sequence computed by the system a title he/she does not want, and by relaunching the execution of the system. 

1. A method of generating sequencing information representing a sequence of items selected in a database, each of the items comprising a set of descriptors, characterised in that it comprises the steps of: a) specifying a length of said sequence and at least one of said descriptors; b) applying similarity relation techniques between said items; and c) generating a fixed-length sequence having a morphological continuity.
 2. The method according to claim 1, wherein each of said items is represented by a series of constraint variables having a domain in the database.
 3. The method according to claim 1, wherein said similarity-relation applying step comprises modelling each of said descriptors in a desired sequence as a constrained variable.
 4. The method according to claim 1, wherein said similarity-relation applying step comprises applying a global similarity relation technique by combining individual similarity measures on all of said descriptors.
 5. The method according to claim 1, wherein said similarity-relation applying step comprises providing mathematical similarity functions.
 6. The method according to claim 1, wherein said similarity-relation applying step comprises providing similarity relations defined by given thresholds.
 7. The method according to claim 1, wherein said sequence-generating step comprises transforming said at least one of said values into unary constraints in terms of constraint satisfaction programming techniques.
 8. The method according to claim 7, wherein said sequence-generating step further comprises subjecting said unary constraints to a processing of variables domain reduction.
 9. The method according to claim 1, wherein said descriptors are expressed in terms of descriptor/value pairs respectively, and each of said values for said descriptor is selected from descriptor/value lists.
 10. The method according to claim 9, wherein each of said descriptors is associated to a descriptor type.
 11. The method according to claim 10, wherein said descriptor type comprises at least one type selected from the group consisting of Integer-Type, Taxonomy-Type and Discrete-Type.
 12. The method according to claim 1, wherein said step of specifying at least one of said values comprises specifying a first title and a last title of said items in said sequence.
 13. The method according to claim 1, wherein said step of specifying at least one of said values comprises specifying a morphological style of said items in said sequence.
 14. The method according to claim 1, wherein said database comprises musical pieces.
 15. The method according to claim 1, wherein said values comprise titles, and said titles form a music program.
 16. A system adapted to implement the method of claim 1, comprising a general-purpose computer and a monitor for display of the generated information.
 17. A computer program product adapted to carry out the method of claim 1, when loaded into a general purpose computer. 